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Pneumonia affects so many people around the world, which leads to many of 
them being killed. In order to identify and diagnose the disease, the patient 
must first undergo an x-ray scan of the chest (CXR). Disease will be identified 
according to the CXR images. Software diagnostic tools are used to help 
decision-making and to promote the pneumonia diagnosis process. From these 
tools is the gray level distribution moments (GLDM) algorithm which is used 
for CXR image features extraction, and we used the meta-heuristic algorithm 
representing the basic butterfly optimization algorithm using lèvy flight 
(BOALF) and the modified butterfly optimization algorithm (BOARN) to 
detect pneumonia on the basis of these extracted features. And then we've also 
been making hybrid between the BOA with fuzzy membership function to get 
as novel method called it fuzzy butterfly optimization algorithm (FBOA). 
These methods were based on various x-ray images of the chest. In testing 
phase, the proposed method obtained the highest diagnostic rate of the disease 


compared to the other two methods in this work. 


This is an open access article under the CC BY-SA license. 





Corresponding Author: 


Baydaa I. Khaleel 

Department of Computer Science, College of Computer Science & Mathematics 
Mosul University, Iraq 

Email: baydaaibraheem @uomosul.edu.iq 








1. INTRODUCTION 

For many children around the world pneumonia is the cause of death. Two types of causes of 
pneumonia are either bacterial pneumonia, or viral pneumonia. Antibiotics are used to treat bacterial 
pneumonia, while supportive treatment is required for viral pneumonia, which makes the diagnostic accuracy 
very significant. Diagnosis of pneumonia is typically by analyzing x-rays of the chest, that is, by radiographic 
examination [1]. If there's a lack of radiologists, the diagnosis speed would be reflect negatively. The 
discovery of pneumonia is carried out by traditional CXR analysis, and to help doctors and radiologists 
interpret the chest images quickly and accurately, some Artificial Intelligence algorithms are used such as the 
butterfly optimization algorithm [2]. And Figure 1 shows the chest x-ray images, sample of normal, and 
sample for infected for the bacterial pneumonia, and viral pneumonia [3], [4]. 

To facilitate image handling, minimize image data and aim to obtain a subset of essential and 
efficient image data features by extracting the features. To extract the features from the images of the used 
database, an algorithm (gray level distribution moments) was used here. Different features have been 
extracted in this method, namely mean, standard deviation, skew — ness, energy and entropy, which depend 
on the first order graph [5]. 

To detect pneumonia disease, in this work we used one of the swarm intelligence method is a 
butterfly optimization algorithm, it is one of the nature-inspired algorithms that depends on population 
behavior, as it is considered a biological behavior model for finding mating and food source. These 
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algorithms which are inspired by nature will find the best solutions to many different problems [6]-[8]. There 
are many works relating to the use features in chest radiographs for pneumonia detection. Ebenezer J et al, 
proposed firefly based fuzzy c-means for chest x rays images segmentation [1]. X. Wang et al, used deep 
convolution neural network based "reading chest x-rays", to recognize and locate common patterns of disease 
trained with image-level labels only [9]. Yu. Tang et al, built and tested various deep convolution neural 
networks CNN to differentiate between normal and abnormal frontal chest radiographs [10]. D. S. Kermany 
et al, used a pre-trained Inception V3 model as an extractor for fixed features to classifying normal and 
pneumonia infected pediatric (CXR)s and further distinction between viral and bacterial pneumonia with a 
region under the curve AUC of 0.968 and 0.940 respectively then the average accuracy 95.5% [11]. V. 
Chouhan et al, for pneumonia detection, they presented a transfer learning-based approach, they got accuracy 
equal 96.3% [12]. S. H. Yoo et al, they used a deep learning-based decision-tree classifier for chest x-ray 
images classification and covid-19 detection, and the CXR images classified by decision tree got accuracy of 
95% [13]. T. Mahmud et al, they presented a proposed method by using (multi-dilation CNN) for covid-19 
detection, and pneumonia detection and classification [14]. N. Wang et al, used ResNet-101 and ResNet-151 
with effect for fusion for classification of chest x-ray images to three group such as, normal, viral pneumonia, 
and covid-19 achieved accuracy of 96.1% [15]. M. Z. C. Azemin et al, used a deep learning method based on 
the ResNet-101 CNN model to recognize and detect abnormality in the chest x-ray images, this method got 
accuracy 71.9 % [16]. 





(a) (b) 


Figure 1. Chest x-ray images, (a) normal CXR, (b) bacterial pneumonia, (c) viral pneumonia 


This research has been organized as: section 2 presents the proposed method obtained by combining 
the fuzzy membership function with the butterfly optimization algorithm (BOA) to produce a proposed 
algorithm called fuzzy butterfly optimization algorithm (FBOA), section 3 contains the description of the 
gray level distribution moments, section 4 discusses the details of the butterfly optimization algorithm 
(BOA), section 5 includes the performance measurements, section 6 explains the results of the experiments, 
section 7 conclusion. 


2. PROPOSED METHOD 

To improve the work of the butterfly optimization algorithm and to help butterflies develop global 
and local search capabilities and find the best solution. We hybridized between the fuzzy membership 
function and butterfly optimization algorithm. And then we got to a novel method called FBOA, and the 
equation of global and local search after hybrid method became as: 


Xt+1 = Xt + (B; x G* — XŁ) x F; (1) 
Xit! = Xt + (B; x X$ — Xf) x F; (2) 


Where B; is the fuzzy membership function 





B, = (u:;)" (3) 
1 
ij — Ste 
ce (Pi) (4) 
deat (52) 
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Function B; coefficient is based on the fuzzy generator q, which is an actual number larger than one. 
And to improve the work of the proposed algorithm, after adding the membership function to the equations 
for adjusting butterfly positions to find the best solution. Another fitness function was used to ensure optimal 
detection and classification of pneumonia diseases. Where the law or the ratio of disclosure ratio was used 
represented by (5): 


a num. of correctly detected image 





x 100 
total num. of image (5) 


Calculates the detection ratio at each step to ensure the optimum solution in the classification process for x- 
ray images. Where the algorithm continues to run until the best detection ratio is reached. 


3. GRAY LEVEL DISTRIBUTION MOMENTS 
First-order probability distribution It is one simple way for extract the statistical features in the 
image. Estimate first order histogram of p (c) is: 


N(c) 


T (6) 





p (c) = 


Where in an image: M is represent total number of pixels, and c represent a gray level, and number of pixels 
of gray value(c) represented by N (c), where (0 < c < L — 1). And (7) represented how to compute features 
which as extracted by using first order histogram: 


L-1 
Mean: Sy = Ct = >» c p(c) (7) 


c=0 
This feature denoted to the mean of pixel intensities values 
L-1 1/2 
Standard deviation: Sp = 0p = > (c =- 7)? rol (8) 


c=0 
This feature represented the (variance) or standard deviation between the pixels for input image. 


L-1 


1 
Skew — ness: Sç = DIG —c)? p(c) (9) 
© c0 


The Skew — ness is distinguishes degree of symmetry of density distribution around the average density. 
The value of Skew — ness can be negative or positive. 


L-1 2 


Energy: Sy = PAQ) (10) 


c=0 


The energy is important to examined the (repeated transitions) which represent power content in a determine 
frequency band. 


L-1 


Entropy: Se = ) pc) loga{p(o)} (11) 


c=0 


By entropy feature, the randomness of the gray level distribution is represented. If gray levels are distributed 
randomly throughout the image then the entropy will be high [5], [17], [18]. 
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4. BUTTERFLY OPTIMIZATION ALGORITHM 
4.1. Butterfly 

Butterflies use the senses of smell, taste, hearing, sight to locate the source of food and to mate. 
These senses are of great importance, helping the butterfly lay eggs in the right locations, and helping them to 
migrate from one location to another, as well as helping them to avoid the predator. In a butterfly the most 
important sense is the sense of smell, from which the butterfly can scent the scent of nectar from long 
distances. Butterflies have sensory receptors distributed across the body of the butterfly, and use them to 
smell, such as antennas, legs. Butterflies use these receptors to explore mating and the food source. 
Chemoreceptor are referred to as these receptors, which are neurons on the butterfly corpus surface. It is the 
chemoreceptor that direct the butterflies to find the best mating partner for genetic line to continue. And the 
male butterflies will recognize the female butterflies, via the pheromone. And pheromone is the fragrance 
secretion which is released from the butterfly to cause particular reactions [19]. Butterflies can identify with 
high precision the source of the fragrance, as well as feel intensities fragrances and can separate the different. 
In BOA algorithm the butterflies are agents the search. The butterfly can produce a fitness-related fragrance, 
as the fitness of the butterfly varies from one position to another according to its movement. The fragrance 
spreads over long distances, and the other butterflies can detect it. Butterflies will then exchange personal 
details with the other butterflies in this situation. When the butterfly detects the fragrance from another 
butterfly, it heads towards the butterfly that emits the fragrance, this step in algorithm called the global search 
process. But if the butterfly can not feel the fragrance that surrounds it, then the butterfly moves randomly, 
this stage is called the local search stage in the algorithm. Butterflies have three properties, these are [20]: 
a. All butterflies release some fragrance to attract butterflies to one another. 
b. Random movement of each butterfly or move towards of best butterfly in the direction of the butterfly 

that produces more fragrance. 
c. The area of research is influenced by the landscape and this, in turn, would affect the butterfly's stimulus 
intensity level [21]-[23]. 


4.2. Fragrance movement of butterflies 

The algorithm of (BO) have own a basic characteristic that distinguish it from the other meta- 
heuristic algorithms, which is that each fragrance has a personal touch and unique smell, the fragrance (F) in 
the (BO) algorithm was identifying as a function of stimulated physical intensity as in [24, 25]: 


F =es* (12) 


F represent the fragrance magnitude perceived, i.e., the fragrance strength thatith butterfly 
perceived. sensory modality represent by e. AndSrepresent the intensity of stimulus, and a represent 
modality dependent exponent of power, that explains the absorption degree. BOA contained of two stage, 
that represented by the local search stage and global search stage. In global search stage, The butterfly is 
heading towards the best butterfly, that is this solution known as G*that represented by (13): 


Xf*1 = Xt + (le'vy(A) x G* — X$) x F; (13) 


Where Xfrepresent the position for ith butterfly at iteration t. In the current generationG”* represents the best 
solution.F;represent of the fragrance ofith butterfly, step size represented byA. Local search stage 
represented by (14) 


xed = xt y (Je'vy(2) xxi — xf) x F; (14) 


Xt and Xx} represent kth and jth butterflies from solution space. Ais step size, then (13) becomes a local 
random walk. To control the global and local search in BOA the probability of switching (p) is used because 
the search for food can occur globally and locally level. And algorithm 1 represent pseudo code of the BOA 
[23], [26]. The general flowchart of standard BOA [25] shown in Figure 2. 

The algorithm 1. BOA 








F(X),X = (Xa X2 naedecasy XaimO ) is objective function, dim = Dimensions number 
The initial population (popul) of N butterflies are generated popul; = 
G@= 12.0, N) 


determined intensity of stimulus $; at ispopul;jby the fitness value f(popul;) 
Defining power exponent a , sensor modalitye, pswitch probability 
while ( t(max_g en) do 
in the population for each butterfly BF 
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for each butterflies calculate the fragrance by using equation (12) 
end for loop 
Find the Best BF 
For each BF in colony 
Randomize number rand is generated from [0,1] 
If rand(p then 
Moving towards the best solution, apply global search by Eq. 13 
else 
randomly move, apply local search using Eq.14 
end if statement, 
end for loop 
Adjustment value a of the power exponent 
end while 
Producing the best solution and the optimal value. 











Initialize the parameters (no. of butterfly (search agents), no. 
of generations maxGen, dimension) 





Generate the initial population (random butterflies) 








Define power exponent a, sensory modality e, switch probability p 
Initialize to 0 









no 


Move butterflies around 


Calculate the fragrance for each butterflies 


Select best solution 







Output best solution found, best 
objective function 





Update butterflies , update sensory 


modality e, increment t 


Figure 2. Flowchart BOA 


In 2016, the researchers (Sankalap Arora, and Satvir Singh) modified the optimization butterfly 
algorithm by using a random number instead of a lèvy flight. And equations of the two global and local 
search butterfly algorithm became as in [27]: 

Global search Equation is 


Xt = Xt + (r? x G*— X$) xF; (15) 
And the (local search stage) represented as in (16): 
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Xft" = Xi + (r? x X$ — Xf) x Fj (16) 


r represent number selected from [0,1] 

In this paper, the Euclidean distance between the characteristics of the x-ray images and the centers 
of the clusters was extracted as a fitness function to calculate the S value, which is represented intensity of 
stimulus in the equation for the calculation of fragrance magnitude F in order to obtain the best detection or 
classification of x-ray images. 





DimFeature 
S= ` (dataz — cluscentjz) (17) 


Z=1 


Where the data represents the x-ray image data represented by the feature extracted using the gray level 
distribution moments (GLDM) method and the cluscent represent the center of the cluster. 


5. PERFORMANCE MEASURES 
In this work, two indicators were used to measure the accuracy of the methods used. Which are the 
classification rate and detection rate as shown in [5], [6]: 


ee number of images classified correctly 
Classification rate = - — x 100 (18) 
number of images used for training 





, number of correctly det e cted images 
Detection_rate = - x 100 (19) 
total number of images 





6. EXPERIMENTS RESULTS 

In this paper, several types of butterfly algorithms were implemented to detect pneumonia based on 
x-ray images of the chest. To extract features from these images, we used the GLDM algorithm, and we used 
the meta-heuristic algorithm to detect and classify pneumonia. Such as the standard butterfly optimization 
algorithm that using a lèvy flight (BOALF) for a local and global search and the modified butterfly 
optimization algorithm using random number (BOARN) in local and global search. And we combined fuzzy 
membership function with standard BO algorithm and got a novel method called the fuzzy butterfly 
optimization algorithm (FBOA) that was proposed. The first step in implementing a model to detect 
pneumonia on x-ray images is to collect image data that will consist of training sets and model testing. 
Dataset available for download was used on the kaggle website. Kaggle provides users a wide variety of 
different datasets which can be used for different research purposes. Dataset was used of the pneumonia CXR 
images, this dataset consisted of normal person CXR images, and images of chest x-ray representing viral 
and bacterial pneumonia. Figure 3 shows the pneumonia detection system block diagram. 












Dataset features extraction Gray Dataset 


Chest X-Ray Level Distribution images 
(CXR)images Moments(GLDM) features 









Output: detect and classify meta-Heuristic methods 
pneumonia j (BOALF,BOARN,FBOA) 


Figure 3. Block diagram of a pneumonia detection system 


Training and testing dataset consist of (1607) CXR images with two category of CXR images, it is 
normal and pneumonia. Category pneumonia is two kind which is viral pneumonia, and bacterial pneumonia. 
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We used 1167 images for training phase, 440 images in test phase. Figures 4 to 6 show the samples of chest 
x-ray images of normal persons, samples of chest x-ray images of pneumonia bacterial of persons, samples of 
chest x-ray images pneumonia virus of persons, respectively. 


EDL. 
a. pHi 


PN Fe Pe 
w he ipi pra ab 





Figure 4. Samples of CXR Figure 5. Samples of Figure 6. Samples of CXR 
normal CXR pneumonia bacterial pneumonia virus 


At first, we were used GLDM algorithm to extract features from all images. Powerful features, and 
image data reduction. And those features were then normalized. We conducted numerous experiments for 
system evaluation, and also to investigate the effect of the methods parameters used (BOALF, BOARN, 
FBOA). The number of CXR images in each category is shown in Table 1. 


Table 1. Number of (CXR) images in each category 











Cx Rimage ye Training dataset Testing dataset 
No. of images Rate% No. of images Rate% 
Normal 493 42.245 54 12.273 
Bacterial Pneumonia 328 28.106 219 49.773 
Viral Pneumonia 346 29.649 167 37.54 
Total 1167 100 440 100 





In this work we have applied BOALF, BOARN, and the novel method FBOA methods to the dataset 
of chest x-ray images. These methods classified the training data set containing (1167) images in the training 
stage and obtained a classification rate equal to 100%. And the test dataset contained (440) x-ray images of 
the chest, and applied these methods to this dataset. In the testing stage, the novel method FBOA method was 
better than BOALF, BOARN methods, because it is obtained larger detection rate from other methods 
(BOALF, BOARN). Table 2 shows the test stage results for the three categories for chest x-ray images, 
normal, virus pneumonia and bacterial pneumonia. And Figure 7 shows the relation between BOALF, 
BOARN, FBOA algorithms with detection rate. 


Table 2. Results of testing stage using BOALF, BOARN, FBOA for three category 








Images type Detection rate of BOALF Detection rate of BOARN Detection rate of FBOA 
Normal 94.5 97.01 99.21 
bacterial Pneumonia 89.3 91.13 93.99 
viral Pneumonia 92.1 92.22 95.8 
average 91.96 93.45 96.33 





Also, when these methods were applied in the test-stage for two of chest x-ray images normal and 
abnormal categories, the new FBOA method was better than the BOALF, BOARN methods, because FBOA 
had got large detection rate (97.62), whereas BOARN obtained the detection rate (95.83) and the BOALF 
method detection rate equal (92.96). Table 3 shows the test results for the two categories normal and 
abnormal of the chest x-ray images for pneumonia disease. Figure 8 shows the relationship between BOALF, 
BOARN, FBOA algorithms with detection rate for two category normal and abnormal in test stage. 

The proposed new method obtained the highest pneumonia detection and classification accuracy 
compared to previous work as well, for pneumonia detection, researcher M. Z. C. Azemin et al. Obtained 
accuracy 71.9 %, and D. S. Kermany et al. obtained accuracy 95.5%. and also the researcher V. Chouhan et al. 
Obtained accuracy 96.3%, and as well as S. H. Yoo et al, obtained accuracy 95%, while the new proposed 
method FBOA in this work obtained a highest accuracy 97.62 % compared to these previous works. And also 
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In classifying pneumonia diseases, researcher N. Wang ef al. Obtained accuracy 96.1%, while the proposed 
new method FBOA obtained higher accuracy of classification equal to (96.33). 
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Figure 7. Relationship between algorithms BOALF, BOARN, FBOA with detection rate for three category 


Table 3. Results of testing stage using BOALF, BOARN, FBOA for two category 




















Images type _ Detection rate of BOALF Detection rate of BOARN Detection rate of FBOA 
Normal 95.4 97.3 99.5 
Pneumonia 90.52 94.36 95.74 
Average 92.96 95.83 97.62 
105 
100 : 

D Normal 
= 95 DO Pneumonia 
= O average 
3 90 : 
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a 
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BOALF BOARN FBOA 
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Figure 8. Relationship between algorithm BOALF, BOARN, FBOA with detection rate for two category 


7. CONCLUSION 

In this work, meta-heuristic methods representing various types of butterfly algorithms were 
implemented to detect pneumonia based on chest x-ray images. We used GLDM algorithm for extracted 
powerful features to facilitate image manipulation and we used the best meta-heuristic algorithms, the 
(BOALF). And modified algorithm of the butterfly (BOARN), And by combining the fuzzy membership 
function with the butterfly algorithm, we improved that algorithm and we got a novel improved method 
called FBOA. And applied these methods to dataset of chest x-ray images. And the new method FBOA got 
the highest rate of detection among the methods used in this system. Also, the proposed new method obtained 
the highest accuracy of detection and classification compared to previous works as well. 
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